Pubmed Parser: A Python Parser for PubMed Open-Access XML Subset and MEDLINE XML Dataset XML Dataset
نویسندگان
چکیده
منابع مشابه
Schema Based Parallel XML Parser: A Fast XML Parser Designed for Large XML Files
XML is one of the greatest innovations of the digital world. It has taken the field of Web Technology by storm in the past decade and is becoming an ever-present technology in other fields too. XML with its easy usage has lot of future. But the parsing performance of XML is a big hindrance to its development. Particularly, when dealing with huge XML files, normal XML parsers like DOM, SAX parse...
متن کاملSoK: XML Parser Vulnerabilities
The Extensible Markup Language (XML) has become a widely used data structure for web services, SingleSign On, and various desktop applications. The core of the entire XML processing is the XML parser. Attacks on XML parsers, such as the Billion Laughs and the XML External Entity (XXE) Attack are known since 2002. Nevertheless even experienced companies such as Google, and Facebook were recently...
متن کاملTXP: a transaction-based XML parser
The recent emergence of eXtensible Markup Language (XML) as a new standard for data representation and exchange on the World-Wide Web has drawn significant interest in XML parsers. Almost all existing XML parsers are implemented based on either Lex and Yacc clones, or from scratch and in recursive manner. This paper describes a transaction-based XML parser called TXP which is equiped with a lig...
متن کاملNCBI/GenBank BLAST Output XML Parser Tool
We describe a small freely available computer script to extract ‘real world’ sequence descriptions from the BLASTX results from sequences generated by the stand-alone ncbiblast2.2.26 suite of tools (available from NCBI/GenBank). Our Python (2.7) script is intended to make name extraction feasible for thousands, of hundreds of thousands, of sequences such as that generated by BLASTX analysis o...
متن کاملIBP: An Index-Based XML Parser Model
With XML widely used in distributed system, the existing parser models, DOM and SAX, are inefficient and resource intensive for applications with large XML documents. This paper presents an index-based parser model (IBP), which contains validation and non-validation modes, supports nearly all the XML characteristics. IBP has the characters of speediness, robustness and low resource requirement,...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Journal of Open Source Software
سال: 2020
ISSN: 2475-9066
DOI: 10.21105/joss.01979